Skip to main content

Supported File Types for Content Inspection and OCR

The following table lists file types recognized by Cyberhaven's content inspection services and shows whether each format supports text extraction (content scanning) and optical character recognition (OCR).

File TypeContent ScanningOCR
jsYesNo
jpgNoYes
pyYesNo
pngNoYes
pycNoNo
tsYesNo
pdfYesYes
jsonYesNo
cYesNo
mapYesNo
hYesNo
classNoNo
javaYesNo
txtYesNo
htmlYesNo
pyiYesNo
jpegNoYes
svgNoYes
mdYesNo
heicNoNo
stringsYesNo
mp4NoNo
csvYesNo
tsxYesNo
datYesNo
movNoNo
xmlYesNo
propertiesYesNo
xlsxYesNo
loopdataNoNo
ithmbNoNo
yamlYesNo
cssYesNo
phpYesNo
mjsYesNo
ymlYesNo
jarYesNo
tsvYesNo
ktYesNo
zipYesNo
docxYesNo
plistYesNo
rbYesNo
riYesNo
gifNoYes
logYesNo
wavNoNo
incYesNo
mp3NoNo
tifNoYes
sqlYesNo